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Abstract —We add to a line of work considering the impact 
of observation imperfections in models of Bayesian observational 
learning. In particular, we study a discrete-time model in which in 
each time-slot, an agent may randomly arrive. Agents who arrive 
have the opportunity to buy a given item. If an agent chooses 
to buy, this action is recorded for subsequent agents. However, 
the decisions of agents that choose not to buy are not recorded. 
Hence, if no one buys in a given slot, agents are unaware if this 
was due to no agent arriving or an agent choosing not to buy. 
We study the impact of this uncertainty on the emergence of 
information cascades. Using a Markov chain based analysis, we 
show that the probability of incorrect cascades and the expected 
time until a cascade happens are not monotonic in the arrival 
probability of a user. We find that adding a small uncertainty 
in the arrival information from the perfect information setting 
will make a buy cascade happen with higher probability than 
a not-buy cascade. However, if the agents’ private signals are 
weak, then a not-buy cascade is more likely to occur for most 
arrival rates, resulting in wrong cascades dominating when the 
item is good and vice-versa when the item is bad. 

I. Introduction 

A key feature of many on-line platforms is that they provide 
users with data about the actions of the other users such as 
how many users bought a given item or a summary of the 
users’ experiences via reviews. A subsequent user can then 
attempt to learn from this data about the action she should 
take, e.g., whether to buy an item or not. Such settings can be 
studied under the framework of Bayesian observational learn¬ 
ing, which has its roots in the economics literature (e.g. [2], 
[3]). In this framework, the agents are viewed as players in a 
dynamic game with incomplete information who form beliefs 
about the value of different actions based on observations 
of the actions of other players as well as their own private 
signals. In the simplest setting, these agents sequentially make 
a binary decision while perfectly observing the actions of all 
prior agents. A key result, first shown in [2] and [3], is that 
such models may exhibit information cascades, meaning that 
at some point an agent ignores their own private information 
about the item and chooses an action solely dependent on their 
type, which for homogenous agents results in blindly following 
the actions of the preceding agents. Moreover, for the models 
in [2] and [3], once a cascade starts, all subsequent agents also 
cascade, leading to herding. Though individually optimal, this 
may result in the agents making a choice that is not socially 
optimal, i.e., a “wrong cascade" where an item is bought even 
when it is not beneficial to do so or vice versa. 


Many variations of these types of models have been studied, 
such as differences in the types of signals received by each 
agent [4] or allowing agents to observe only a subset of past 
actions [6]. In [5] and [7], the agents can see only a random 
sample of past observations, which are of unknown order. In 
this paper, we instead assume that agents may randomly arrive 
in discrete time even though the order of arrivals is public 
information. In previous work [9], we considered another 
variation in which the arrivals were deterministic but there 
were random errors in the observations of the actions of other 
agents, hence imperfect observations. This led to the following 
counter-intuitive result: the probability of a wrong cascade is 
non-monotonic in the error level, i.e., in some cases, a higher 
error rate is beneficial. In this work, we consider a different 
way in which the observations of the agents may be imperfect. 

Our work is motivated by on-line platforms that report 
information on how many users have bought a given item, 
but do not specify how many users have considered buying 
the item and did not buy. Specifically, we consider a discrete¬ 
time model in which at each time-slot an agent randomly 
arrives with a given probability (and doesn’t arrive with the 
complementary probability). If an agent arrives it has the 
opportunity to either buy an item or not. If she chooses to 
buy, this action is recorded for all subsequent agents to see. 
On the other hand, if either no agent arrives in a certain epoch 
or one arrives and chooses not to buy, the subsequent agents 
will observe an “empty slot" in such an epoch. This introduces 
uncertainty in the observation history: if there is an empty slot, 
the agents are unsure as to whether it was because an agent 
chose not to buy or simply because no agent arrived. 

In our model, as in [2], [3], [9], all agents receive the same 
value from buying the given item, which is determined by 
an unknown binary state of the world that takes values from 
{Good, Bad}. If the state is Good, agents benefit from buying 
the item, while if it is Bad, agents are better off not buying. 
Each agent receives an independent, identically distributed 
binary signal about this state of the world. Our Bayes-rational 
agents then determine the a posteriori probability distribution 
of the state of the world given their private signal and 
observations, and choose to buy the item if their expected 
pay-off is greater from that action than from not buying. 

In our prior work [9], random errors occurred in the 
observations, where the error rate is assumed to be the same 
for each action choice (“buy" or “not buy"). This led to a 



symmetry in the model: an observation of an agent buying 
and not buying conveys the same amount of information if 
agents are not herding. In this paper, instead the uncertainty 
in the observations is asymmetric. A slot in which an agent 
arrives and buys conveys more information than an “empty" 
slot. Further, the information in an empty slot decreases as the 
arrival probability decreases. 

We show that as in [2], [3], [9], an information cascade 
will eventually occur with probability one, and once started, 
such a cascade will persist forever, i.e., the agents will herd. 
We then study the probability of a wrong cascade. To do this 
we utilize a Markov chain based analysis in which herding 
corresponds to absorbing states. Using this, we show that the 
probability of wrong/incorrect herding is not monotonic in 
the probability that an agent arrives in any time epoch. In 
some cases having a higher arrival probability (and thus more 
information conveyed in empty slots) increases the probability 
of a wrong cascade. We also discover that if the private signals 
are weak, the probability of a wrong cascade can be higher 
than that of a correct cascade. Moreover, we find that adding a 
small uncertainty will make a buy cascade happen with higher 
probability than a not-buy cascade due to a bias toward buy 
actions in the history. 

We organize the paper as follows. In Section II we specify 
our model. The model’s properties and an information theo¬ 
retic explanation are presented in Section III. The main results 
are presented in Section IV. We conclude in Section V. The 
technical details can be found in the archived version [10], 

II. Model 

We consider a variation of the model in [9] in which time 
is divided into discrete slots indexed n = 1,2,.... Different 
from [9], we assume that in each time slot n, an agent is 
present with a probability a £ [0,1].' If an agent is present 
at the time slot n, he chooses an action A n of either buying 
(Y) or not buying (N) a new item. The true value (V) of the 
item can be either good ( G ) or bad ( B) and is the same for 
all agents. For simplicity, both possibilities are assumed to be 
equally likely. 

The agents are Bayes-rational utility maximizers 1 2 whose 
payoff structure is based on the agent’s action and the true 
value V. If an agent chooses N, his pay-off is 0. On the other 
hand, if he chooses Y, he faces a cost of C = 1/2 and gains 
one of two amounts depending on the true value of the item: 
his gain is 0 if V = B and 1 if V = G. The total pay-off of 
an agent choosing Y is then the gain minus the cost. Thus, 
the ex ante expected pay-off of each agent is 0. 

To reflect the agents’ prior knowledge about V, if an 
agent arrives at time n, he receives a private signal S n £ 
{1 (high), 0 (low)} through a binary symmetric channel (BSC) 

1 'two special cases need highlighting: i) a = 0. where the model is 
degenerate with no incoming agents; and ii) a = 1, where the agents are 
always present and yields the model in [2] with symmetry in both states of 
the world. 

2 This assumption leads to a Perfect Bayesian Equilibrium that uses the 
common information based belief to determine strategies, as proved in [6]. 


with crossover probability 1 —p, where 0.5 < p < 1. In other 
words, we have 

P (S n = l\V = G)= P (S n = 0\V = B)=p, and 
P(S n = 0\V = G) = P{S n = l\V = B) = l~p. (1) 

Thus, the private signals are informative, but not revealing. 

Denote an observation at the time slot n as O n , we have 
either O n = Y when an agent is present at time n and chooses 
to buy the item, or O n = E denoting an empty slot if either 
there is no agent arriving at time n or there is one who chooses 
not to buy the item. Denote the observation history after time 
n as H n = {Oi,...,O n }. Similar to [9], we assume that 
H n is recorded via a common database that is available to all 
subsequent agents but without any further imperfections. 

III. Model properties and Bayesian updates 
In this section, we present the properties of our model. In 
particular, the observation history H n can be summarized by a 
sufficient statistic that is a weighted difference of the number 
of Ys and Es in the history, which we will elaborate on 
shortly. As in [9], both Y and N cascades are permanent. 
However, different from [9], there is an asymmetry in the type 
of observations. As the probability that an agent is present in 
any time slot, a, decreases, an empty slot O n = E conveys less 
information even though an observation ()„ = Y conveys the 
same amount of information. As a result, it requires a greater 
number of empty slots to create an N cascade as a decreases. 


A. Public likelihood ratio as a Markov process 

Let q = 1 —p. Similar to [9], agents’ decisions are based on 
Bayes updates of the posterior probability of V = B versus 
V = G given the observed history . However, due to the 
conditional independence of signals from the public history 
given the value of V, if present agent n+l can instead compare 
the public likelihood ratio, £ n , and his private belief, (3 n+ 1 , 
of V = B versus V = G. There is one subtle difference, 
however. In [8] and [9], £ n denotes the ratio after an agent 
n has decided. In this paper, £ n is the public likelihood ratio 
after the discrete time slot n, as an agent may not be present 
at time n. Since V being B or G is equally likely, £q = 1 and 
we can rewrite £ n and /3 n+ i as follows: 


P[H n \V = B] 
P[Hn\V=GY 


and fi n+ i 


P[S n+1 \V = B] 
P[5„ +1 |U = G]‘ 


( 2 ) 


The higher i n is, the more likely that V — B. Moreover, 
since H n is public information, if after slot n — la cascade 
does not happen, £ n can be updated as: 


£ 


n 


TE^/n-u if O n = E, 
§4-1, if On = y 


(3) 


Otherwise, if agent n — 1 cascades, then £ n = 4-i with 
probability 1. Given £ n , one can determine if an agent cascades 
or not. Thus, {£ n } is a Markov process. Moreover, this is 
also true, if in addition, we condition on the value of V. In 
addition, we can show that {£ n } (resp. {1/4}) is a martingale 



conditioned on V = G (resp. V = B ). On the other hand, 

Pn+i = q/p (resp. p/q) if S n+} = 1 (resp. S n+1 = 0). 

B. Agents’ decision rule and cascades’ condition 

By (3), any cascading action provides no information about 
V, and a cascade is permanent. Thus, conditioned on a cascade 
not occurring before time n — 1, let be a random variable 
denoting the number of Y actions in the history until n. In 
addition, let e n = n — denote the number of empty slots 
at time n in the history. We have the following lemma: 

Lemma 1. Let y = \og p/q ]Z// p G (0,1 ) for a G (0,1). Then: 

1) t n = ( q/p) hn , where the exponent h n = a n — ye n ; 

2) Conditioned on V, (a n ,e n ) and h n are 2-D and 1-D 
Markov chains, respectively, for n > 0; and 

2) Agent n + 1, if present, cascades Y if h„ > 1, cascades 
N if h n < — 1, and follows his signal if h n G [— 1,1] . 

Proof. 1) By (3), £ n = ( q/p) an [(1 — ap)/( 1 — aq )] e ", thus 
h n can be written in terms of a n and e n as above. 

2) This is a direct consequence of the fact that {£„} is a 
Markov process and that, from the first property, there is a 1-1 
correspondence between l n and h n . Further, since a n and e n 
are integer-valued it follows that h n only takes on a countable 
number of values. 

3) Since agent n + 1, if present, makes a decision by 

comparing £ n f3 n+ 1 to 1, she cascades Y if £ n < q/p , cascades 
N if £ n > p/q, and follows her signal if £ n G [q/p,p/q]. By 
1), this translates to the given condition on h n . □ 

Note that y is an indicator of how weak an empty slot is 
with respect to the signals. That is, the lower y is, the weaker 
the empty slots are relative to the signals. In addition, y is an 
increasing function of a, and y = 0 (resp. 1) if and only if a = 
0 (resp. 1). For a generic y, the dynamics of the process {£ n } 
can be studied by investigating the 2-D Markov chain (e„, a n ). 
However, for special values of y, this can be simplified. We 
will study a few of such scenarios in Section IV. 

Note that both Y and N cascades are permanent and the 
probabilities of reaching each cascade is positive. In addition, 
in contrast to [9] where Y and N observations in succession 
cancel each other out, here Y and E observations in succession 
leads to a bias towards a Y cascade. Hence, a sufficiently 
long sequence of Y and E pairs can also create a Y cascade. 
In Section IV, we present results on the probability of each 
cascade conditioned on the state of the world. 

C. An information theoretic view on the parameter y 

Recall that given a cascade has not occurred, the sufficient 
statistic is h n = a n — ye n , where we can rewrite y as: 

V = [log(l - aq) - log(l - ap)\ / [log(p) - log(g)]. (4) 

This is the exponent of the public likelihood ratio, £ n . More 
precisely, let L n = log £ n , be the public log-likelihood ratio, 
then we have 

Ln = K log (q/p) = -h n (log(p) - log(g)). (5) 


Next we give an information theoretic interpretation of these 
results. Following [1], we can express the public log-likelihood 
ratio given an observation sequence x n in terms of relative 
entropies (K-L divergenges) as follows: 

L n {x n ) = n [. D{P x n\P B ) - D(P xn \P a )] (6) 

where P x ™ is the empirical distribution of the observations, and 
Pg and Pb are the probability of an observation conditioned 
on V being equal to G and B , respectively. 

Given a public observation history at time n consisting of 
a n Y actions and e n empty slots, we have that n = a n + e n , 
P x n (y) = a a fi e and P x n (e) = a e f e . Moreover, we have 
that Pb(v ) = aq and P B {e) = 1 — aq. Likewise, Pc(y ) = ap 
and Pb(c) = 1 — ap. Using these in (6) gives 

t ( n\ i / a n /(a n + e n ). , s- n /[y n + e n ) 

L n (x n ) = a n log(-) +e„log(---) 

aq 1 — aq 

( , / a n /[a n -f- c n ). , / a n /{a n T e n ), \ 

- a n log(-) + e n log(---) 

\ ap 1 — ap J 

= a„ log (p/q) - e n log((l - aq)/{ 1 - ap)). 

Note the parameter y in (4) is exactly the ratio between the 
weights given to the a n and e n terms here and using this we 
have the relationship in (5), as expected. 

Using this we can further interpret this parameter as fol¬ 
lows. Let P e denote the empirical distribution given a single 
observation of an empty slot (i.e., this is simply a single atom 
on e). Then we have 

D{P e \P B ) - D(P e \Pa) = log(—!—) - log(—!—) (7) 

1 — aq 1 — ap 

= — l°g((l ~ aq)/ (1 — ap)). (8) 

Likewise, let P y denote the empirical distribution given a 
single observation of a Y action, so that 

D(P y \P B )-D{P y \P G ) = log(l/aq) — log(l/ap) = log (p/q). 

Comparing with the above we have that 

-y = [D(P e \P B ) - D(P e \P G )\ / [D(P y \P B ) - D(P y \P G )[ 

i.e., y gives the ratio of the value of observing a single empty 
slot to that from observing a single Y action. 

IV. Analysis and results 

In general, the underlying Markov chain is a 2-D chain 
with the states being the pairs (e n ,a n ), which denote the 
number of empty slots and the number of Y actions in the 
history. For rational values of y, the state space of the chain 
can be enumerated and this facilitates the analysis of the 
chain’s asymptotic properties. The following lemma further 
simplifies the 2-D chain to a 1-D chain with the states being 
h n = a n — ye n , and then enumerates these states. 

Lemma 2. Let y = j, where k, l are integers satisfy¬ 
ing gcd(k, l) = 1. Then the non-cascading state space of 
the underlying 1-D Markov chain is the finite set sY = 
i-1 -*=± — ij 




Figure 1: The 2-D MC for rational values y = k/l and V = G, 
where l = kb + c, c = 0,..., k — 1 , b = 1 , 2,... 


V = G, p = 0.51 



Figure 2: Wrong cascade probability versus a, the probability of an 
agent being present in each time slot, for V = G, p = 0.51. 


Given that h n € [—1,1], its values must belong to a subset 
of ,<//. Lemma 2 further states that the state space before a 
cascade happens is exactly .&/ for all rational values of y. 
Proof. See Appendix A in our archived version [10]. □ 

In Fig. 1, we show the state space and the transition 
dynamics of the underlying Markov chain for rational values 
of y = k/l. A related state space simplification was given 
for the model in [8]; however, a different approach is required 
here. In [8], the state transitions in a Y -cascade region follow a 
simple birth-death chain; this allows the recurrence equations 
to convert any states h n > 1 to a corresponding state in 
[—1,1]. In this paper, there is no birth-death chain. However, 
for any rational y the states will eventually repeat themselves 
in a structured way as shown in Fig. 1. We rely on this 
observation to simplify those states to a finite subset, based on 
which a finite set of linear equations can be written down and 
solved to get P and Er. In the remainder of this section, we 
provide closed-form expressions for P and Er for a few values 
of y when V = G. Note that for V = B, similar expressions 
can be derived and these are given in the appendices of our 
archived version [10]. 

A. Probability of wrong cascades 

For a rational value of y, we can use Lemma 2 to numeri¬ 
cally solve for and plot the wrong cascade probabilities, P. As 
a result, we can show that the probability of wrong/incorrect 
cascade is not monotonic in the probability that an agent 
arrives in any time epoch, as demonstrated in Fig. 2 for a 
low value of the signal quality (i.e., p = 0.51). The dashed 
lines in these figures show the corresponding probability when 
a = 1. There are points of discontinuity which happen when 
the y value changes. In addition, for a sufficiently low value 
of p = 0.51, as shown in Fig. 2, the probability of a wrong 
cascade can be higher than that of a correct cascade. Moreover, 
adding a small uncertainty will make a Y cascade happen with 
higher probability than a N cascade due to the bias toward Y 
actions in the history. Results for different values of p can be 
found in our archived version [10]. 

For special values of y, we can derive closed-form expres¬ 
sions of this probability, which give additional insights on the 
discontinuities seen in the numerical results. In particular when 
V = G, let a = ap denote the probability of having one 


more Y action in the history before a cascade happens. For 
any positive integer l, the following two propositions provide 
closed-form formulae for the probability of wrong cascade, P, 
when y = 1/1 and y = (l — l)/l, respectively. 

Proposition 1. Let y = l/l, where l is any positive integer. 
When V = G, the probability of wrong cascades, P, satisfies: 

P= (l-a) l+1 / [l-a(l-<*)'(! + /)] • (9) 


Proof. See Appendix B in our archived version [10]. □ 


Proposition 2. Let y = (l- 1)//, where the positive integer 
l > 3. 3 Let x = a(l — a). When V = G, the probability of 
wrong cascades, P, satisfies: 


(1 — a) 2 


1—X 


+ 2cr 2 (1 — a) _1 — 1 


1 — 4 a 1 1 (1 — a) 1 


GO) 


Proof. See Appendix C in our archived version [10]. □ 

For a fixed p, both Propositions 1 and 2 show that P 
decreases as a increases such that we always satisfy y = 
1/7, (7 — 1)//, respectively. In other words, P is increasing 
in l in Proposition 1 and is decreasing in l in Proposition 2. 
More insights into the discontinuities at a = 0 and a = 1 can 
be gained by studying the limits of P when y = 1 /l, (l — 1 )/l 
as l —> oo for a fixed p. In particular, in the first scenario, 
let l —> oo when y = 1//; this yields a f 0 + . Using 
Proposition 1, lim„m+P = - - -—, which is 

F p(logg-logp) . ’ 

p — q ' V q ) 


higher than the value of P = q when a = 0. In the second 
scenario, let l —> oo when y = [l — 1)//; this yields a f l - - 
Using Proposition 2, we have lim^ 


= aSktuai which is 

l-pq ’ 


lower than the value of P = 


<F+p- 


when a = 1. Therefore 


both scenarios again demonstrate discontinuities that happen 
at a = 0 and a = 1, as shown in Fig. 2. More importantly, 
both limits also show that a lower wrong cascade probability 
can be achieved if a is near one of these discontinuities by 


3 If l = 1 then y = 0, a = 0 and this reduces to a trivial case of no 
observation history. If / = 2, then y = 1/2 and one can refer to Proposition 
1 for an expression of P. 

4 This expression is a consequence of the wrong cascade probability in [2], 
[3], and in [9] for the special case where there are no observations errors (i.e., 
e — 0 .) 






























Figure 3: Expected time until a cascade happens versus a, for p = 
0.70. 


reducing it a small amount so that P moves to the left hand 
side of the discontinuity. In other words, agents can do better 
if a controlled amount of uncertainty is introduced through 
reducing the probability of arrival a. This observation applies 
for the case V = G. However, in contrast, when V = B, one 
can show that reducing uncertainty through increasing a leads 
to better outcomes. 


B. Expected time until a cascade happens, Er 

Using similar approach, one can numerically solve for the 
expected time until a cascade occurs, Er. The results are 
shown in Fig. 3 for p = 0.70. Depending on the range of a, 
different values of V £ {G, B } can lead to higher Er. There 
are points of discontinuity which happen as different values of 
a cause changes in the underlying state space. One interesting 
observation is that unlike the case V = G, for V — B the 
expected time Er is higher as p increases, given a fixed a. (See 
Figs. 22, 24, 26 in Appendix G in [10].) The reason being a 
higher signal quality leads to more agents having low signals; 
therefore a higher number of empty slots induces a longer time 
to N cascades. Furthermore for values of y £ {1 //, (l — 1)/(}, 
one again can write down the closed-form expressions for Er. 
These expressions are given in the next two propositions. 


Proposition 3. Let y = 1 /l, where l is any positive integer. 
When V = G, the expected time until a cascade, Er, satisfies: 


v 2 — (^ + 1)(1 — a ) 1 + (l — 1)(1 — a ) l+1 
T a[l-(l+l)a{l-a) 1 } ' ( } 


Proof. See Appendix D in our archived version [10]. □ 


Proposition 4. Let y = (l — 1 )/l, where the positive integer 
l > 3. Let x = a(l — a). When V = G, the expected time 
until a cascade, Er, satisfies: 


Er = 


2^+2(l-. 


1 — X 


+ 2a l 2 (1 — a) 1 


1 — 4 a 1 1 (1 — a) 1 


( 12 ) 


Proof. See Appendix E in our archived version [10], □ 

For a fixed p. Proposition 3 shows that Er is increasing in 
l (i.e., decreasing in a). Similarly, Proposition 4 shows that 
Er is decreasing in l (i.e., decreasing in a). More insights 
into the discontinuities at a = 1 can be gained by studying 


the limits of Er when y = (l — 1) /Z as Z —>■ oo for a fixed 
p. 5 In particular, let l —>• oo when y = (l — 1) /Z; this yields 
a t 1 _ - Using Proposition 4 gives lim^- Er = 2 + 2p l I fi l pq , 

which is lower than the value of Er = \ — when a = 1. 

1-2 pq 

Therefore, adding a little uncertainty leads to a faster expected 
time toward either type of cascades. Moreover, we can also 
show that lima-H- Er is not monotonically decreasing in p. 
This means when a little uncertainty is present, a higher signal 
quality does not necessarily lead to a faster time to a cascade. 
(See Fig. 17 in Appendix G in [10].) 

V. Conclusions and future work 

This paper considers Bayesian observational learning in 
which agents arrive in discrete time-slot and only one action 
(“buying”) is recorded. This introduces an asymmetry in the 
observation history as a slot in which an agent arrives and 
buys conveys more information than an “empty" slot. Further, 
the information in an empty slot decreases as the arrival 
probability decreases. We show that an information cascade 
will eventually occur with probability one, and once started 
such a cascade will persist forever, i.e., the agents will herd. 
We then utilize a Markov chain based analysis to study the 
probability of a wrong cascade and the expected time until 
a cascade happens. We show that both quantities are not 
monotonic in the probability that an agent arrives in any 
time epoch. In some cases, having a higher arrival probability 
(and thus more information conveyed in empty slots) leads 
to worse outcomes. Moreover, we also discover that if the 
private signals are weak, the probability of a wrong cascade 
can be higher than that of a correct cascade. In the future, 
we are interested in generalizations where agents also arrive 
randomly and leave reviews (as in [8]) that are observable (but 
not their actions). 
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5 Note that there is little interest in the limiting value of Er as a —»■ 0, 
since if y = l/l and we let l —>■ oo, then Er —>• oo, which can be verified 
from both Fig. 3 and equation (11). 














, n : 4fY actions 



Appendix A 
Proof of Lemma 2 

Let l = kb+c where c < k and (c, k) = 1. For simplicity of 
notation, the common denominator l of the accessible states 
is implicitly understood and we only consider the dynamics 
of the numerators. Therefore we reformulate s/ = {—l, —l + 
1,..., l — 1, 1} with the initial state h 0 = 0. If h n £ [— Z, Z], 
the next state is h n + X where X £ {—k, l}. If \h n \ > l, h n 
is absorbing where a Y or a N cascade happens and the next 
state is also h n . 

First, we show that all states in D = {— k, .1} are 
accessible. Denote the state —k as So where the subscript 
indicates the modulus of — k when dividing c. From a state 
Si, where i = 0,1,..., k — 1, one can access the state s i+1 
by adding l and then keep deducting k until reaching a state 
in [— k, —1]. Notice that now s i+1 mod k = c(i + 1). Since 
(fc, c) = 1, all the states s's are distinct and therefore occupy 
all values in the set D. 

Next, since one can deduct k from each state in D, all states 
in [— Z, —1] also accessible. The proof is completed by noting 
that one can add l to each state in [— Z, 0], making all states in 
s/ accessible. 


Appendix B 

Proof of Proposition 1 

If V = G and y = 1//, the underlying Markov chain is 
shown in Fig. 4. Using this, we can write down a system of 
linear equations to solve for the probability of wrong cascades, 
Po.o- Noting that P 0 ,i = (1 - a)Pi,i, Pi,o = (1 - a)P 2 ,o + 
aPi^i, P^i = (1 — a)P 2 ,i, we have 

Po.o = aP 0 ,i + (1 - a)Pi, 0 
= (1 - a) 1 Pi ;0 + al(l - a) 1 ?!,! 

= (1 - a) 2 P 2 ,o + a 2(1 - a) 2 P 2 ,i 

= (1 — a) l V li0 + al{ 1 — a)*P u 
= (1 - a) l+1 + a(l - a) z (l + Z)P 0 ,o 
m (1 - a) l+1 

^ 0,0 = l-a(l-a)'(l +1) 



Note that for V = B, the underlying Markov chain has the 
same state space. However, the transition probability where 
one more Y action is added is now aq (instead of ap as in 
when V = G.) In addition, the correct cascade states in the 
chain when V = G now become the wrong cascade states 
in the chain when V = B. Therefore, we can use a similar 
derivation to obtain the given formula for Po,o- 

Appendix C 

Proof of Proposition 2 

Similar to the case y = l/l, here we can rely on the Markov 
chain in Fig. 5 to write down a system of linear equations to 
solve for the probability of wrong cascades, Po,o- Note that 

Pi,o = aPi,i + (1 — a), we have 

Po.o = (1 — a)Pi,o + «Po,i 
= (1 - a)Pi,o + a(l - ct)Pi i i 
= (1 — a) 2 + 2a(l — a)Pi i i 
= (l-a) 2 + 2a(l-a) 2 P 2 .i 
= (1 - a) 2 [1 + 2a(l - a)] + 2a 2 (l - a) 3 P 3 , 2 


= (l-a ) 2 


it—2 


2 1 - [a(l-a)] _ x 


1 — a( 1 — a) 
+ 2a*- 2 (l-a) z - 1 P z _ liZ _2 


Now, substituting P;_ lj ;_2 = aP;_i^_r + (1 — a)P; i j_2, 
Pi- 1 , 1-1 = (l-a)P 0 ,0, Pi.i-2 = aP 0 ,0+(l—a), we obtain the 
claimed expression for Po.o as in Proposition 2. Note that for 
V = B, the underlying Markov chain has the same state space. 
However, the transition probability where one more Y action is 
added is now aq (instead of ap as in when V = G.) In addition, 
the correct cascade states in the chain when V = G now 
become the wrong cascade states in the chain when V = B. 







































Therefore, we can use a similar derivation to obtain the given 
formula for P 0 p. 

Appendix D 

Proof of Proposition 3 

Again, we can use the underlying Markov chain in Fig. 4 
to write down a system of linear equations and solve for the 
expected time until a cascade of either type happens, Erop. 
Noting that Erop = 1 + (1 — o;)Eti i i, Erip = 1 + (1 — 
a)Er2 l o+aEri ) i, Erip = 14-(1—a)Er 2 ,i, Er/p = l-fcdEr^i, 
and Er; i = Erop, we have 

Er 0 p = 1 + (1 — a)Eri j0 + aEr 0 ,i 
= 1 + a + (1 — a)Erip + a(l — a^Erpi 
= 1 4- (1 — ct) T ol 4- 2a(l — ct) 

+ (1 — a) 2 Er 2 ,o + 2a(l - a) 2 Er 2j i 
= 1 4- (1 — a) + (1 — a) 2 + a + 2ct(l — a) + 3a(l — a) 2 
+ (1 — a) 3 Er 3i o + 3a(l — a) 3 Er 3j i 


= 1 + (1 - a) + (1 - a) 2 + ... + (1 - a) 1 - 1 

T ex 4- 2a(l — ct) T 3ct(l — ct) 2 4~ • • • T (cr( 1 — o// 

+ (1 — ck)^Et/ 5 o + lct( 1 — a) l E Ti t i 

= 1 + (1 - a) + (1 - a) 2 + ... + (1 - a) 1 

T ct 4- 2a(l — ct) 4~ 3ct(l — ct) 2 T ... 4- Zcr( 1 — ct)^ 

4- (i 4- l)a(l - a) ( Ero,o 

_ 1 - (1 - a) l+1 l-(l + 1)(1 - a) l + l{ 1 - a) l+1 
a a 


4~ {l 4“ l)a(l — a) ; Er 0 p 

2 — (i 4- 1)(1 — a) 1 + (l — 1)(1 — a) l+1 


Etqp = 


a [1 — (l + l)a(l — a ) 1 } 
This is the given expression in Proposition 3. 


Appendix E 

Proof of Proposition 4 


Again, we can use the Markov chain in Fig. 5 to write down 
a system of linear equations to solve for the expected time 
until a cascade happens, Erop. Note that Er 3 p = 14-aErpi, 
Er 0: i = 1 4- (1 — a)Er 1 i, Et 1j:l = 1 4- (1 — a^Er^i, we have 

Erop = 1 4- (1 — a)Er 3 p 4- aEr 0 ,i 
= 1 4- a + (1 — a)Erip 4- a(l — a)Erip 
= 2 4- 2ct(l — a) + 2a(l — a) 2 Er 2j i 
= 2 + 2ct(l - a) + 2a(l - a) 2 4- 2ct 2 (l - a) 2 Er 2 , 2 
= 2 + 2a(l -a) + 2a(l - a) 2 + 2a 2 (l - a) 2 
+ 2a 2 (l - a) 3 Er 3j2 


V = G,p = 0.70 



Figure 6: Wrong cascade probability versus a, the probability of an 
agent being present in each time slot, for V = G, p = 0.70. 


Appendix F 

Closed-form expressoins for P and Et when V = B 


Here we provide the closed-form expressions of P and Er 
for the case V = B and y = 1/Z, (Z — 1 )/l. These can be 
derived from the corresponding expressions for the case V = 
G as given in Propositions 1-4. In particular, the expressions 
for P are obtained by replacing a = ap by a' = aq and 
then the results are deducted from 1. On the other hand, the 
expressions for Er are obtained by simply replacing a = ap 
by a' = aq. 

When y = l/l: 


P = l- 


(1 — a') l+l 

-a') l (l + l)’ 


03) 


Er = 


2 — (Z + 1)(1 - a') 1 + (Z- 1)(1 - a') l+1 


a 1 [1 — (Z 4- l)a'(l - a') 1 } 
When y = (l — 1 )/l: 


(1 — a') 2 2±^^ + 2a"- 2 (l-a , ) i - 1 - 


= 1 - 


1 — 4 a’ 1 3 (1 — a') 1 


(14) 


1 


05) 


„ 2^ + 2(1-aT~^+2a' l ~ 2 (1-a'Y _ 

Er l - Aa ' l -^(1 - a ') 1 ’ (16) 


where x' = a'(l — a'). 


Appendix G 
Additional Figures 


= 2(1 4- x 4- ••• 4- x l ~ 2 ) + 2(1 — a)(x + x 2 + ... + x l ~ 3 ) 
+ 2a l ~ 2 (l - a) l ~ 1 KTi_ij_ 2 . 

Now, substituting E;_ 1 ;_ 2 = l4-aEr;_ li j_i4-(l — a)Er; j_ 2 , 
Er = 1-|-(1 — a)Erop, Et;^_ 2 = l-t-ctErop, we obtain 
the claimed expression for Erop as in Proposition 4. 
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V = G, p = 0.99 



Figure 7: Wrong cascade probability versus a, the probability of an 
agent being present in each time slot, for V = G, p = 0.99. 


V = G,p = 0.51 



Figure 8: Expected time until a cascade happens (linear scale) versus 
a, for V = G, p = 0.51. 



Figure 9: Expected time until a cascade happens (log scale) versus 
a, for V = G,p = 0.70. 


V = G,p = 0.70 




Figure 11: Expected time until a cascade happens versus a, for p = 
0.99. The jumps at discontinuity are minimal for the case V = G. 
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Figure 12: Expected time until a cascade happens (linear scale) 
versus a, for V = G, p = 0.99. 


Figure 10: Expected time until a cascade happens (linear scale) 
versus a, for V = G, p = 0.70. 































V = G,p = 0.70, l = 1 : 1 : 500 



Figure 13: Comparing numerical and analytical results, for V = G, 
p = 0.70, y = 1/1. 
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Figure 14: Comparing numerical and analytical results, for V = G, 
p = 0.70,y = (l-l)/l. 

Appendix H 
Additional notes 

If we fix p: y = log p / 9 where {Zap i ncreases in 

therefore y increases as a increases. This means: 

1) If y = l/l, then as a increases, l decreases. 

2) If y = (l — 1 )/l, then as a increases, l increases. 

If we fix a: If p increases, then y decreases. 

1) If y = l/l, then as p increases, l increases. 

2) If r/ = (Z — 1 )/l, then as p increases, l decreases. 


V = G, p = 0.70, l = 1 : r : 500 



Figure 15: Comparing numerical and analytical results, for V = G, 
p = 0.70, y = l/l. 



Figure 16: Comparing numerical and analytical results, for V = G, 
P = 0.70, y = (l- 1 )/l. 


V = G 



Figure 17: Comparing Er between a = 1 versus a J. 1— for V = G, 
y = (l- 1 )/l. 



Figure 18: Wrong cascade probability versus a, the probability of 
an agent being present in each time slot, for V = B, p = 0.51. 



Figure 19: Wrong cascade probability versus a, the probability of 
an agent being present in each time slot, for V = B, p = 0.70. 























































Figure 20: Wrong cascade probability versus a, the probability of 
an agent being present in each time slot, for V = B, p = 0.99. 



Figure 21: Expected time until a cascade happens versus a, for p = 
0.51. 



Figure 22: Expected time until a cascade happens (linear scale) 
versus a, for V = B, p = 0.51. 



Figure 23: Expected time until a cascade happens (log scale) versus 
a, for V = B, p = 0.70. 


V = B,p = 0.70 



Figure 24: Expected time until a cascade happens (linear scale) 
versus a, for V = B, p = 0.70. 



Figure 25: Expected time until a cascade happens (log scale) versus 
a, for V = B, p = 0.99. 


v = B, p = 0.99 



Figure 26: Expected time until a cascade happens (linear scale) 
versus a, for V = B, p = 0.99. 





































